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We were interested in identifying genes in the dipteran flies 
Calliphora erythrocephala and Drosophila melanogaster with a role in 
oogenesis and early embryogenesis . A biochemical screen was used to 
complement the extensive genetic screens that have been performed to 
identify such genes in Drosophila. Radio- labelled cDNA probes were 
made using poly (A) $\sp+$ RNA preparations from staged Calliphora ovaries 
and embryos. These probes were used to isolate clones which were strongly 
expressed during oogenesis but not during embryogenesis. 

Four Calliphora genes which are absolutely "oogenesis-specif ic" in 
their expression pattern , as defined by our screening protocol, were 
identified. These are called A10B, B8I, C7F and GG7K. Three of these clones 
are expressed in the somatically derived follicle cells of the ovary and 
have been identified as being homologous to the Drosophila yolk protein 1 
(A10B and B8I) and a vitelline membrane protein (GG7K) . Interestingly, the 
yolk protein homologs are expressed in a specialized subset of follicle 
cells known as the border cells in Calliphora. The fourth gene, (C7F) is 
expressed in the nurse cells, the transcripts are translocated to the 
oocyte proper and maintained throughout the first four hours of 
embryogenesis. C7F is also expressed in late pupae and adult male flies. 

A number of Calliphora genes were identified which, although not 
oogenesis-specif ic, were more strongly expressed in the oocyte than the 
embryo. These were classified as "oogenesis -dif f erentials" . C7F and the 
Calliphora oogenesis-dif f erential genes were used to screen Drosophila cDNA 
and genomic libraries for homologs. Some characterization of these 
Drosophila homologs is described here. 

ME31B, a maternally expressed Drosophila gene from the 31B region of 
the second chromosome was isolated by directly screening Drosophila 
libraries with Calliphora cDNA probes . ME31B is expressed throughout 
oogenesis and the transcript is maintained in the mature egg until four 
hours after fertilization. The ME31B transcript is evenly distributed 
throughout the oocyte and egg. 

A 1.5 kb cDNA for ME31B was completely sequenced. Comparison of the 
coding sequence with a protein data bank allowed us to show that ME31B 
is a member of a family of NTP-dependent helicases. The possible mutant 
phenotype of ME31B is discussed. 
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Background. Profiling of gene expression in healthy and diseased renal 
tissue is important for elucidating the pathogenesis of renal diseases. 
Comprehensive information about the genes expressed in renal tissue is 
unavailable. The recently developed cDNA array hybridization methodology 
allows simultaneous monitoring of thousands of genes expressed renal 
tissue. Methods. Complex ( alpha - SUP 3 SUP 3 P) - labeled cDNA probes 
were prepared from histopathologically uninvolved remnants of nine renal 
tissues obtained by nephrectomy. Each probe was hybridized to a 
high-density array of 18,326 paired target genes. The radioactive 
hybridization signals by phosphor imager screens were quantitated by special 
software. Bioinf ormatics from public genomic databases were used to 
assign a chromosomal location of each expressed transcript and gene 
function. Cluster analysis was used to arrange genes according to the 
similarity in pattern of gene expression. Results, A total of 7563 
different gene transcripts was detected in the nine tissue samples. 
Approximately 870 of these genes were full-length mRNA human transcripts 
(HT) , and the remaining 6693 were expressed sequence tags (ESTs) . The 
full-length transcripts were classified by function of the gene product and 
were listed with information of their chromosomal positions. To allow a 
comparison between gene expression in clinical and experimental studies, 
the mouse genes with known similar function to the human counterpart were 
included in the bioinformatics analysis. Cluster analysis of 502 
full-length genes that are expressed in four or more renal tissues revealed 
more than 110 genes that are highly expressed in all the renal specimens. 
Conclusions. The presented data constitute a comprehensive preliminary 
transcriptional map of the adult human renal cortex. The information may 
serve as a resource for speeding up the discovery of genes underlying human 
renal disease. The integrated listing of the full-length expressed human 
and mouse genes is available through e-mail (AbdallaRifai@Brown.edu) . 
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Abstract: Currently there are over 1 000 000 human expressed sequence tag 
(EST) sequences available on the public database , representing 
perhaps 50-90% of all human genes. The cDNA microarray technique is a 
recently developed tool that exploits this wealth of information for 
the analysis of gene expression. In this method, DNA probes 
representing cDNA clones are arrayed onto a glass slide and 
interrogated with f luorescently labeled cDNA targets. The power: of 
the technology is the ability to perform a genome-wide expression 
profile of thousands of genes in one experiment. In our review we 
describe the principles of the microarray technology as applied to 
cancer research, summarize the literature on its use so far, and 
speculate on the future application of this powerful technique. 
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Abstract: Proline 3 -hydroxylase was purified from Streptomyces sp, strain 
TH1 , and its structural gene was cloned, The purified enzyme 
hydroxylated free L-proline to cis-3 -hydroxy-L- proline and showed 
properties of a 2 -oxoglutarate -dependent dioxygenase (H, Mori, T, 
Shibasaki, Y, Uosaki, K. Ochiai, and A, Ozaki, Appl, Environ, 
Microbiol. 62:1903-1907, 1996), The molecular mass of the purified 
enzyme was 35 kDa as determined by sodium dodecyl 

sulf ate-polyacrylamide gel electrophoresis, The isoelectric point of 
the enzyme was 4.3. The optimal pH and temperature were 7.0 and 35 
degrees C, respectively, The K-m values were 0.56 and O.ll mM for 
L-proline and 2 -oxoglutarate, respectively, The k(cat) value of 
hydroxylation was 3.2 s(-l). Determined N-terminal and internal amino 
acid sequences of the purified protein were not found in the SwissProt 
protein database , A DNA fragment of 74 bp was amplified by PCR with 
degenerate primers based on the determined N-terminal amino acid 
sequence, With this fragment as a template , a digoxigenin- labeled 
N-terminal probe was synthesized by PCR, A 6.5-kbp chromosome fragment 
was cloned by colony hybridization with the labeled probe , The 
determined DNA sequence of the cloned fragment revealed a 870 -bp open 
reading frame (ORF 3) , encoding a protein of 290 amino acids with a 
calculated molecular weight of 33,158, No sequence homolog was found in 
EMBL, GenBank, and DDBJ databases , ORF 3 was expressed in Escherichia 
coli DH1, Recombinants showed hydroxylating activity five times higher 
than that of the original bacterium, Streptomyces sp, strain TH1, It 
was concluded that the ORF 3 encodes functional proline 3 -hydroxylase . 
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Abstract: Mass spectrometric peptide mapping of proteins isolated by 

polyacrylamide gel electrophoresis is a rapid method for identifying 
proteins in sequence databases . A majority of tryptic peptide maps 
were found to contain pairs of peptide ion peaks separated by the 
molecular weight of the lysyl or arginyl residue. These peaks originate 
from amino acid sequence patterns such as Lys-Lys where trypsin 
has cleaved C-terminals to either one of the lysines. The peptide mass 
and the pattern define an N- or C-terminal sequence tag . Searching 
sequence databases by such a sequence tag results in only a 
moderate number of matches and significantly reduces the number of 
database matches when used in combination with a peptide mass map. Two 
N- or C-terminal sequence tags alone unambiguously identify a protein 
in most cases. The technique discussed here is simple, does not require 
additional measurements, and increases the percentage of protein 
samples that can be identified by their mass maps alone. N-Terminal 
peptide sequence tags for database searching can also be generated 
by manual one- step Edman degradation of the unseparated peptide 
mixture . 
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Abstract: In recent years the application of DNA typing information to 

criminal investigations has gained widespread acceptance. The primary 
method currently in use relies on length variation of DNA restriction 
fragments between individuals. These variations are identified using 
variable number tandem repeat (VNTR) DNA probes . As this technology 
becomes more widely used, it is crucial that scientifically valid 
methods of interpreting the significance of a DNA typing result be 
adopted. The method chosen should not only give a reliable 
approximation of the statistical likelihood of a particular RFLP 
pattern occurring, but should also be easy to present and for the 
court to understand. In this manuscript five methods of calculating a 
frequency of occurrence of a RFLP pattern will be presented (fixed 
bin genotype, floating bin phenotype, floating bin genotype, National 
Research Council (NRC) method using fixed bins and the NRC method using 
floating bins) . The calculations discussed here demonstrated that the 
fixed bin genotype method produces a frequency very similar to that 
obtained from floating bin phenotypes. In addition, regardless of the 
method chosen or the database size, the frequency of any particular 
banding pattern in the population over several loci was found to be 
very rare. 
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